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ABSTRACT 


The effects on classification accuracies due to changes 
in data resolution are examined in timber stand classifica- 
tions using remotely sensed multispectral data. Such an 
investigation is valuable in deciding on optimal sensor and 
platform designs. Data resolution means the actual ground 
area covered by a pixel recorded on the multispectral data; 
high resolution implies small ground area. 

To date results in the data resolution study indicate 
that classification accuracies for data v/ith high resolution 
are actually less than the accuracies for data with lower 
resolution. This conclusion is supported by theoretical 
justifications and by experimental verification. The 
verification was performed on multispectral data sets over 
the Sam Houston National Forest. 
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1.0 OBJECTIVES 


This data resolution investigation is designed to; 

• Study the theoretical effects on classification 
accuracies due to changes in data resolution 

• Verify the theoretical conclusions by performing 

a forestry classification, using real and simulated 
data with various reduced resolution. 



2*0 JUSTIFICATION 


It is often expected that multispectral data with 
different data resolution permits different classification 
accuracies for varied hierarchies of ground features, This 
follows from photointerpreters • experience in mapping ground 
features, using varied scales of imagery. In this paper, 
data resolution means the actual ground area covered by a 
pixel, or picture element, recorded on the multispectral 
data, e.g., as recorded by the multispectral scanner. 

The question naturally arises; "What is the optimal 
data resolution for classification of remotely sensed data?" 
For specific applications, e.g., forestry applications, the 
same question must also be answered for the specific features 
of interest. In forestry applications, the features are 
timber stands of different species and/or condition classes. 
Species composition defines the timber type of the stand, 
while the age and/or size determines the condition class of 
the timber stand. 

Forest scenes are particularly complex, especially 
when viewed from low altitudes. The complexity is due to 
the nonhomogeneity of the tree patterns, nonuniformity of 
the composition of trees in the stand, variation in the 
undergrowth and spacing between individual trees, texture 
effects due to shadows, etc. All these effects are signifi- 
cant for data with resolution less than perhaps 20 meters 
square. 



Low altitude data, i.e. , high resolution data, is 
sometimes undesirable for machine processing and recognition. 
This is contrary to the belief and practice in photointerpreta- 
tion, The reason is machine processing of multispectral data 
is not completely parallel with the human interpretation pro- 
cess. The latter process actually extract information such as 
texture, shape of tree crowns , shadows of trees indicating 
their profiles, etc., from high resolution data, which are 
used for detection of details, However, the machine process 
has not been advanced to using these nonspectral information 
that photointerpreters uso; the standard training field pattern 
recognition technique of Laboratory for Application of Remote 
.Sensing System (LARSYS) (ref. 1) uses spectral information 
alone, Thus, forest scene complexities in high resolution 
data make stand identification difficult. Smoothing out the 
complexities would be expected to improve classification 
accuracies . 

Guided by this intuition, the complexities in forest 
scenes are smoothed by simulating lower resolution data from 
high resolution data. This is achieved by an averaging 
process, which simulates data sets obtained at higher 
altitudes. With this kind of modeling, the theoretical 
effects on classification accuracies due to resolution 
reduction can be examined, And with this kind of simulation, 
experimental work can be performed to verify the theoretical 
conclusions . 


3,0 SCOPE OF APPLICATIONS 


Since this work is part of the Forestry Applications 
'Project on Timber Resources (PAP/TR) (ref. 2), forest 
scenes over Sam Houston National Forest (SHNF) were examined. 
Mission M230 of the C-130 aircraft was flown over SHNF on 
March 21, 1973 at 10,000 feet altitude; the Bendix 24 channel 
multispectral scanner (MSS/24) onboard the aircraft collected 
MSS data. Eight edits from the MSS/24 coverage were selected 
for study, as discussed in section 2.6 and appendix H of 
reference 3, These are called Edit numbers 3, 6, 9, 12, 14, 

18, 53 and 54. Each edit is approximately 11 square kilometers, 
The forest features of interest are listed in table I, 

The data sets over the eight edits were preprocessed. 
Preprocessing means calibration, scan-angle correction and 
registration to ground (ref. 2 and 4-8). The resulting data 
resolution is approximately 8 meters square. 


TABLE I. - THE ViPG (CLASSES) AND CONDITION CLASSES (SUBCIASSES) 

OF FOREST MATURES OF INTEREST IN SAM HOUSTON NATIONAL FOREST OF TEXAS 


Typo No. 

'Type (Class) 

Subclass No. 

Condition Glass (Subclass) 

X 

Shortleaf pine 

1.1 

Plantation * 3 years old 



1.2 

Poletimber - immature 



1.3 

Sawtimber - immature 



1.4 

Sawtimbcr - mature 

2 

Loblolly pine 

2.1 

Plantation - 1 year old 



2.2 

Plantation * 3 year old 



2.3 

Seedling and Sapling - 




adequately stocked 



2.4 

Poletimber - immature 



2.5 

Sawtimber - immature 



2.6 

Sawtimber - mature 

3 

Laurel oak - 

3.1 

Sawtimber - immature 


willow oak 



4 

Sweetgum - nuttal 

4.1 

Sawtimber - low quality 


oak - willow oak 

4.2 

Sawtimber - immature 



4.3 

Sawtimber - mature 

5 

Post oak - black 

5.3 

Sawtimber - immature 


oak 



6 

Loblolly pine - 

6.1 

Sawtimber - immature 


hardwoods 



7 

Cut -over land 

7.1 

Site prepared and wind rowed 



7.2 

Not site prepared 









4.0 TECHNICAL APPROACH 


4.1 Simulation of Data with Reduced Resolution 

An averaging or weighted averaging relationship is 
assumod between data at different resolutions. 

Por example, data {X} has a resolution of (5 meter) . 

Data {Y} has a reduced resolution, i.e., coarser resolution, 

2 2 
of (10 meter) . Thus, the same (10 meter) ground area 

covered by one Y measurement, y , will be covered by four X 

measurements, Xp x 2 , x 3 , and x^. The averaging relationship 

is 

y « ?.f(x 1 +x 2 +x 3 +x 4 ) . 

The weighted averaging relationship is 

Y m w 1 V w 2 x 2 tw 3VV t 4 E w i “ l > w i* 0 • 

Data set (X) and set {Y} could theoretically be acquired 
using the same sensor, flown at two altitudes; the altitude 
for {X} is half the altitude for {Y}, (See section 2.5 of 
ref. 3.) 


4.2 Classification and Evaluation Procedures 

The classification technique is the widely used scheme 
of supervised pattern recognition, That is, training fields 
are selected to train the maximum likelihood classifier, such 
as in LARSYS (ref. 1) . Normal statistical distributions are 
assumed on the training classes. Equivalently, the Bayes' 
classifier using equal a priori probability is employed 
(ref. 9) . 


The evaluation procedure is the calculation of classi- 
fication accuracies of training fields/classes . The classi- 
fication accuracy is a measure of the statistical probability 
of correct classification (PCC) (ref. 10) , which is another 
widely accepted evaluation parameter. Also, the divergence 
measure (ref. 10) is calculated to convey the extent of 
separability between classes. In special cases, it has been 
established that the divergence measure has direct relation- 
ship with PCC. 


5.0 THEORETICAL RESULTS 


5.1 Probability of Correct Classification 

The following discussion shows that there is a gain in 
the PCC when the data resolution is lowered. Actually, the 
probability of misclassif ication (PMC) for the 2-class 
classification case is computed below; PMC = 1 - PCC . 

Data sets {X} and {Y} are studied, where (Y) is a mX 
(i.e. m times) reduction of {X} ; i.e,, a generic data 

9 

point in {Y} is an average of m data points in (X) . 

Assume the following notations for the means and 
covariance matrices for the two classes C^ and C 2 in the 
data sets { X } and (Y) *. 


C l : y Xl,^xi; y Yl,2 Y i 

C 2 * y X2,^X2; y Y2,^Y2 . 

These parameters can be estimated from the {X} and (Y) data 
sets, using the normal method of training field selection 
and statistics calculation. By the averaging process which 
simulates {Y} from {X} , it can be shown that: 

y Xl = y Yl 

2 X i = m ^Y1 


y X2 = y Y2 





Using these statistics to train the classifier, the Bayes' 
regions (ref. 10) for equal a priori probabilities are 
established and denoted by R X1 , R x2 £° r data set {X} and 
by Ry^ Ry 2 for data set {Y} . It can be shown that 




R X2 = R Y2 


By the definition of PMC, which is (ref, 10) 


PMC = i 5 Prob(R 1 /C 2 ) + JjProb(R 2 /C 1 ) , 


it can be seen that 


(PMC) X > (PMC)y 

That is, PMC is lower for data set { Y } than for {X} , 
because the distributions in data set {Y} taper off quicker 
than in {X } ; this is a result of the relationship between 
the covariance matrices. Therefore, 

(PCC)y > (PCC) X . 

That is, the classification accuracy will be higher for the 
lower resolution data {Y} than for the higher resolution 
data {X> . 


cjt 2 


5.2 Separability: Divergence Measure 


The following establishes that the divergence between 
and C 2 'increases with the lowering of the data resolution; 
the same situation as in section 5.1 is assumed. The diver- 
gence measure is used because it has been shown (ref. 11) that 
the divergence value relates to PCC. In fact, when ~ X«2 

for and C 2 , the divergence J(C^,C 2 ) between and C 2 
has a one-to-one relationship with PCC; and, J(C^,C 2 ) 
increases if and only if PCC increases. Generally, the larger 
the divergence value, the more separable C^ is from C 2 , 


as • 


The divergence J(C^,C 2 ) between C. and C 2 is defined 

J(C l -,C 2 ) - %tr ^5^2 "^1 J 

+ . 5 [, 1 -M 2 ] T [E 1 - 1 ^ 2 - 1 ]K-y 2 ] . 


By the relationship established in section 5.1 between 

l^Xi and Pyi’ X*xi an< ^ ^Yl’ ^X2 an< * ^Y2’ 2^X2 an< ^ £y2’ 
J x (Ci,C 2 ) and J y (C 1 ,C 2 ) for data sets (X) and {Y} can be 

related by the following inequalities: 

Jx^ C i*C 2 ) £ dy(C^,C 2 ) £ m J X (C 1 ,C 2 ) • 

J y =m 2 Jx in the special case when » ancl w ^ en 

there is no averaging, i.e., when data set { Y } is identical 
to data set {X}. 


In general, when {Y} is a weighted average of {X} ; i.e., 
a generic data point, y , in { Y) relates to the generic data 
points, , in {X} in the following manner: 


y = 


m 

E 

i=l 


w.x. 
1 1 


E w i =1 


w. >0 

l— 


2 

the m factors in the above discussion will be replaced by 

2 2 
m times a constant. This constant equals 2^w. , and takes 

2 1 
values between 1 and 1/m . That is, 


where 1 < k < m 


J x ( c i, c 2 ) £ J y (C r C 2 ) < k J X (C 1 ,C 2 ) 
2 


6.0 FORESTRY APPLICATIONS: TO DATE RESULTS 


6.1 Data Set Studied: Edit 9 

During this reporting period, only Edit 9 out of the 
8 edits was investigated. A three-channel color rendition 
of the MSS data is shown in figure 1, with the timber stand 
and compartment boundaries delineated on the imagery. The 
codes for timber stand types are found in table I of 
section 3.0. 

The unreduced data plus two simulated data sets were 

studied: IX, 2X, and 3X, where IX has a data resolution of 

2 2 

approximately (8 meters) ; 2X, C16 meters) ; and 3X, (24 
2 

meters) . The simulation is by the data resolution reduction 
program discussed in section 2.5 of reference 3. 

Only 12 channels on the MSS/24 were operating during 
M230. They are the channels numbered 1 through 11 and 13 
on the scanner. However, in Edit 9 data, these channels 
are numbered 1 through 12. The spectral coverages of these 
channels are shown in table II. 

6.2 Field Selection 

The fields selected for classification and divergence 
studies are shown in figure 2. The entire Edit 9 area is 
divided into 3 sections, left (L) , middle (M) and right (R) ; 
hence the labels of fields, e.g., L2.5, R2.5. 
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Figure 1. — Edit numbir 9. 
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TABLE IX, 


SPECTRAL COVERAGES OF 12 CHANNELS OF 
THE EDIT 9 MSS DATA 


Channel No. 

Spectral coverage (micrometer) 

1 

0.375-0,405 

2 

0.40-0.44 

3 

0.466-0,495 

4 

0.53-0.58 

5 

0.588-0.643 

6 

0.65-0.69 

7 

0.72-0.76 

8 

0.770-0,810 

9 

0.82-0.88 

10 

0.981-1.045 

11 

1.20-1,30 

12 

2.10-2.36 












Because it was felt that the scan-angle correction 
performed earlier in the project did not adequately correct 
for the scan- angle effects, the left fields, middle fields 
and right fields were studied separately. Actually, it was 
found that the mean values for 2,5 fields in the left differ 
by a few counts from the 2.5 fields in the right; hence 
the conclusion on inadequate scan-angle correction. 

The same physical fields were selected from IX, 2X, and 
3X. Thus, the field coordinates in IX, 2X, and 3X are directly 
related. 


6.3 Data Processing 

The IX, 2X, and 3X data were processed on the Earth 
Resources Interactive Processing System (GRIPS) . The left 
four fields, middle two fields, and right three fields were 
studied separately. 

Statistics of these fields were generated; pairwise 
classification and divergence calculations were made. For 
example, for the right three fields R1.3, R2.3 and R2.5, there 
are three pairs: R1.3/R2.3, R1.3/R2.5 and R2.3/R2.5. 

Classification and divergence calculations were performed 
using three different channel sets. (1) 8 channels: numbers 

2, 4, 5, 7, 8, 9, 10, and 11. (2) 4 channels: numbers 3, 5, 

8; and 11, (3) 4 best channels as dictated by the channel 

selection processor on ERIPS: numbers 2, 7, 10, 12 for IX 

and 3X; numbers 2, 3, 7, and 11 for 2X. 




In case 1, the 8 -channel set was chosen arbitrarily 
because of the limitation of ERIPS in the divergence 
calculation. Channels 1 and 3 were arbitrarily dropped, 
because channel 2 contains very similar information (at 
least visually); channel 2 was retained; channel 6 was 
dropped because of data drop-out; channel 12 was dropped 
because data values were very low. In case 2, the 4-channel 
set was arbitrarily chosen, and spaced throughout the 12 
channels, The channel set in case 3 was dictated by the 
channel selection processor on ERIPS. 

6.4 Analysis Results 

The results of performing the classification and 
divergence measurements are summarized in figures 3 through 
5. Each figure is in bar-chart form, 

Each bar- chart shows the classification accuracy for 
the pairwise classification (on the ordinate) versus the 
specific pairs of classes used in classification (on the 
abscissa) . The classification accuracy is a measure of 
PCC and is given by J$(classif ication accuracy of + 
classification accuracy of C 2 ) * J classification accuracy of 
C- = the number of points of C. correctly classified into 

X X 

C^/the number of points of C^. 


*Beside this definition of classification accuracy, 
other measures have also been commonly used; for example: 
number of correctly classified points of and C 2 /total 
number of points of and Cg* 
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The pairwise divergence values are also indicated in 
the bar-charts. The vrlues are written in the bars. 

Figures 3 through S correspond to cases 1, 2, and 3 
of data processing discussed in section 6.3. The IX, 2X, * 

and 3X results are shown side-by-side. 

,6.5 Inference from Analysis Results 

Figures 3 through 5 lead to the following conclusion’ 
Classification accuracies increase With the lowering of data 
resolution. Also, classes are more separable at lower 
resolution. 

This conclusion reinforces the theory discussed in 
section 5.0. 





7.0 THE PARADOX AND SOME REMARKS 


7.1 The Paradox 

The theoretical and experimental results conclude that 
classification accuracies increase with the reduction in 
fidelity of data resolution. This gives rise to the 
following paradox: 

"If ground features can be classified 
using low altitude data, they can also 
be classified, and even with higher 
accuracies, using high altitude data." 

7.2 The Paradox's Intuitive Explanation 

The paradox should not cause any alarm, because the 
statement is asserted for classification accuracies alone, 
and because the classification technique employs spectral 
information alone. 

The accuracy measure used in the analysis comes from 
evaluating training/test data which are well defined and 
delineated due to prior knowledge. The loss in boundary 
accuracy and mensuration accuracy in the analysis of higher 
altitude data has not been accounted for. These two factors 
are most often deciding factors on optimal data resolution. 
Also, the gain in details at higher resolution is not an 
asset to the spectral classification rule. In fact, the 
details in texture, etc. , add to the complexity in machine 
processing in this case. 


Another explanation for the increase in classification 
accuracies for lower resolution data is that a M PERFIELD M 
classification (ref. 11) is performed on the lower resolution 
data, compared to a "PERPOINT" classification on the higher 
resolution data. A "PERFIELD" classification rule has been 
suggested to be superior to the "PERPOINT" classification 
rule, That is, nonhomogeneity in the higher resolution data 
is reduced by the averaging process, which gives the lower 
resolution data. This explanation is readily acceptable, 
especially for forest scenes, where complexities abound 
with high resolution. 


7.3 A Remark on Detection 

An interesting remark follows from the conclusion of 
the above analysis. For detection purposes, ERTS-1 , for 
example, will outperform aircraft data analysis, as long as 
the features to be detected have physical sizes larger than 
the ERTS-1 resolution (preferably at least four times larger, 
in order to assure total containment of the feature in at 
least one pixel). Detection here means the detection of 
the presence of the feature, disregarding its size. 

7.4 Decision on Optimal Data Resolution 

An optimizing criterion can be set up where the optimal 
choice of data resolution is a compromise between classifica- 
tion accuracy, boundary accuracy and mensuration accuracy. 

The criterion, D, could then be written as 


D = 


d., 0 + d o 8, + 

1 c 2 b 



d,0 , 

3 m 


where d^, d 2 > and ^3 are weights in the criterion, and 
0,0., 0 are respectively the accuracies in classification, 
boundary location, and mensuration. An optimal solution for 
data resolution will be obtained by achieving maximum value 
of the criterion D . Different applications will call 
for different weights d^, d 2 , and d^; and will produce 
different solutions. Other factors such as the cost of data 
acquisition, cost of data processing, etc., can be also 
incorporated into the criterion as follows: 

D' = d x 0 c + d 2 0 b + d 3 0 m + d 4 C a + d 5 C p , 

where C and C are the respective costs, 
a p 

An optimal decision on data resolution will lead to an 
optimal design of sensors and platforms. 




8 . 0 CONCLUSIONS 


This study has derived the theoretical effects on 
classification accuracies due to the changes in data 
resolution. The modeling of the data reduction scheme 
led to simulation of forestry data at lower resolutions. 

The performance of the forestry applications experiment 
led to analysis results that reinforce the theoretical 
inferences. That is, classification accuracies increase 
with the reduction in fidelity of data resolution using 
the training field and maximum likelihood classification 
scheme. 

This apparent paradox is explained in section 7.0. 
Suggestions are then made concerning the optimal design of 
data resolutions, and thus the optimal design of sensors 
and platforms. 

Further efforts in this data resolution study will \)e 
spent in: (1) Extension of the results and analyses to 

other areas in the Sam Houston National Forest; (2) Expansion 
of the results and analyses to obtain classification images; 
(3) Quantitative and qualitative evaluation of the resulting 
classification images. 
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